Temporal difference learning - PDFSEARCH.IO - Document Search Engine

Temporal difference learning
Results: 95

#	Item
11	Evolutionary Feature Evaluation for Online Reinforcement Learning Add to Reading List Source URL: eldar.mathstat.uoguelph.ca Language: English - Date: 2016-07-12 12:05:04 Cognitive science Cognition Artificial intelligence Machine learning Belief revision Reinforcement learning Temporal difference learning Q-learning Feature selection Supervised learning Proto-value functions Action selection
12	Continuous Deep Q-Learning with Model-based Acceleration arXiv:1603.00748v1 [cs.LG] 2 Mar 2016 Shixiang Gu1 2 3 SG 717@ CAM . AC . UK Add to Reading List Source URL: arxiv.org Language: English - Date: 2016-03-02 20:31:58 Artificial intelligence Machine learning Computational neuroscience Learning Applied mathematics Artificial neural network Mathematical psychology Q-learning Reinforcement learning Supervised learning Feature learning Temporal difference learning
13	AITF ANNUAL REPORT 2016 DR. RICHARD SUTTON REINFORCEMENT LEARNING AND ARTIFICIAL INTELLIGENCE AITF ANNUAL REPORT MARCH 31, EXECUTIVE SUMMARY Add to Reading List Source URL: webdocs.cs.ualberta.ca Language: English - Date: 2016-05-16 19:35:21 Computational neuroscience Applied mathematics Cybernetics Cognitive science Neuroscience Belief revision Reinforcement learning Machine learning Artificial neural network Temporal difference learning Deep learning Reinforcement
14	fourteen declarative principles of experience-oriented intelligence 1. all goals and purposes can be well thought of as the maximization of the expected value of the cumulative sum of a single externally received number Add to Reading List Source URL: webdocs.cs.ualberta.ca Language: English - Date: 2009-03-27 16:18:08 Education Educational psychology Computational neuroscience Cognition Cognitive science Futurology Prediction Scientific method Learning Temporal difference learning Scientific modelling Educational technology
15	GQ(λ): A general gradient algorithm for temporal-difference prediction learning with eligibility traces Hamid Reza Maei and Richard S. Sutton Reinforcement Learning and Artificial Intelligence Laboratory, University of Add to Reading List Source URL: webdocs.cs.ualberta.ca Language: English - Date: 2010-01-22 02:08:08 Smooth functions Distribution Functional analysis Universal property
16	Sutton, Richard PIN Add to Reading List Source URL: webdocs.cs.ualberta.ca Language: English - Date: 2013-10-18 16:05:54 Computational neuroscience Belief revision Reinforcement learning Computational statistics Q-learning Temporal difference learning Artificial neural network Machine learning Markov decision process Mathematical optimization Algorithm Gradient descent
17	Natural Temporal Difference Learning Add to Reading List Source URL: psthomas.com Language: English - Date: 2014-11-06 09:18:20 Smooth functions Distribution Functional analysis Reinforcement learning Linear temporal logic
18	An Emphatic Approach to the Problem of Off-policy Temporal-Difference Learning arXiv:1503.04269v1 [cs.LG] 14 MarRichard S. Sutton Add to Reading List Source URL: arxiv.org Language: English - Date: 2015-03-16 20:16:49 Algebra Linear algebra Mathematics Markov models Markov processes Matrix theory Matrices Q-learning Markov chain Matrix Reinforcement learning Temporal difference learning
19	Playing Atari with Deep Reinforcement Learning Volodymyr Mnih Koray Kavukcuoglu Add to Reading List Source URL: arxiv.org Language: English - Date: 2013-12-19 20:23:45 Artificial intelligence Computational neuroscience Machine learning Learning Artificial neural networks Cybernetics Q-learning Reinforcement learning Deep learning Markov decision process Feature learning Temporal difference learning
20	Value Learning and Arousal in the Extinction of Probabilistic Rewards: The Role of Dopamine in a Modified Temporal Difference Model Minryung R. Song1, Jean-Marc Fellous2,3,4* 1 Department of Bio and Brain Engineering, Ko Add to Reading List Source URL: amygdala.psychdept.arizona.edu Language: English - Date: 2014-06-10 21:21:48